Method to predict the pattern of locomotion in horses

ABSTRACT

The present invention provides methods for predicting the pattern of locomotion in a horse including the ability of a horse to use different gaits and the ability to trot at a fast speed. The methods comprise determining in a sample of DNA obtained from a horse the presence or absence of at least one genetic marker, wherein said at least one genetic marker is located on horse chromosome 23, said marker being associated with the ability to use different gaits. The invention further provides primers that amplify markers being associated with the ability to use different gaits and hybridization probes to detect markers being associated with the ability to use different gaits and the ability to trot at a fast speed.

FIELD OF INVENTION

The present invention relates to methods for predicting the pattern oflocomotion in horses including the ability of a horse to use differentgaits and the ability to trot or pace at a fast speed. The methodscomprise determining in a sample of DNA obtained from a horse the alleleof at least one genetic marker, wherein said at least one genetic markeris located on horse chromosome 23, said marker being associated with theability to use different gaits.

BACKGROUND

Horses show a considerable variation in their pattern of locomotion bothwithin and between breeds. The three basic gaits in horses are walk,trot and gallop. The horses use these different gaits according to theirspeed, walk is used at slow speed, trot is a faster mode of locomotionand gallop is the gait horses normally use to run fast. However, somehorses have the ability to also use alternative gaits, for example paceand toelt, and such horses are called gaited horses. A horse that pacemoves the two legs on the same side in a lateral movement in contrast toa trotting horse that makes a diagonal movement where the diagonal frontand hind legs move forward and backwards together. Furthermore,Icelandic horses are able to perform a fifth gait named toelt, which isa four beet gait with the same foot fall pattern as the walk. Acharacteristic feature of toelt is that the horse then always has atleast one hoof touching the ground, giving a very smooth gait. Examplesof other similar alternative gaits, also known as ambling gaits, are foxtrot, the rack, running walk and paso cort. The alternative gaits varyin footfall pattern, timing, and cadence, and can be generally dividedinto four categories: pace, regular rhythm ambling, lateral ambling anddiagonal ambling. Table 1 provides a classification of breeds as gaitedor non-gaited horses. Most horse breeds are in fact non-gaited and onlyrepresentative examples of such breeds are listed in the table. Horsesrepresenting breeds classified as non-gaited never or rarely are able toperform the alternative gaits whereas most or all horses from the gaitedbreeds can perform alternative gaits. There are more gaited breedsworldwide in addition to the ones listed in table 1. Sometimes, there isa considerable variation also within breeds as regards the pattern oflocomotion. For instance, Icelandic horses are classified as four-gaitedor five-gaited, where the former can perform walk, trot, gallop andtoelt whereas the latter can also pace.

The Standardbred horse, used for harness racing has a unique ability totrot or pace at a very fast speed without falling into gallop which isthe normal gait at high speed for a horse. In North America, asubpopulation of Standardbred horses that pace at very high speed hasbeen developed. Other horse breeds used for harness racing includesbreeds like the Cold-blooded trotter, Finnhorses, the Frensch trotterand the Orlove trotter.

The pattern of locomotion in horses is under strong selection in horsebreeding. For instance, the ability to race using gallop, trot and paceare selected in Thoroughbred horses, Standardbred trotters andStandardbred pacers, respectively. Horses with the ability to usealternative gaits are also highly desired by some riders and is a traitupon which many specialized breeds have been developed. Methods forpredicting the pattern of locomotion in a horse, i.e. its ability to usedifferent gaits, would therefore have a great utility in the horsebreeding industry.

BRIEF DESCRIPTION OF INVENTION

The present inventors have identified a genetic locus in horses thatdetermines the horse's ability to use different gaits and the ability totrot at a fast speed. A premature stop codon in the gene for thedoublesex and mab-3 related transcription factor 3 (DMRT3) was found inall tested horses with the ability to perform alternative gaits. Mutanthorses express a truncated DMRT3 protein which lacks the last 174 aminoacid residues but maintains a functional DNA-binding domain. DMRT3 isexpressed in a subset of neurons in the spinal cord of the horse.

Accordingly the present invention provides methods for predicting thepattern of locomotion in horses including the ability of a horse to usedifferent gaits, the ability to trot or pace at a fast speed, and theability to perform in dressage.

A first aspect of the invention provides methods for predicting thepattern of locomotion in horses including the ability of a horse to usealternative gaits, the ability to trot at a fast speed, and the abilityto perform in dressage which comprise extracting protein from a sampleobtained from a horse. The methods further comprise determining in saidprotein sample the presence or absence of a truncated form of the DMRT3protein. The DMRT3 protein can be a DMRT3 protein truncated at aminoacid position 300 corresponding to the protein SEQ ID NO: 4. Thedetermination can be made by use of an immunochemical method, such asWestern blot, using an anti DMRT3 antibody.

A second aspect of the invention provides methods for predicting thepattern of locomotion in horses including the ability of a horse to usealternative gaits, the ability to trot at a fast speed, and the abilityto perform in dressage which comprise extracting DNA from a sampleobtained from a horse. The methods further comprise determining in saidDNA the allele of at least one genetic marker, wherein said at least onegenetic marker is located in the region between the flanking SNPs atnucleotide positions 22,628,976 (corresponding to position 51 in SEQ IDNO: 6) and 23,315,071 (corresponding to position 51 in SEQ ID NO: 7) onhorse chromosome 23.

The genetic marker can be selected from single nucleotide polymorphisms(SNPs) and insertion/deletions (INDELs).

Preferably, the genetic marker is selected from the genetic markerslisted in Tables 4, 5, 7 and 8.

Preferably the genetic marker is located in the region between theflanking SNPs at nucleotide positions 22,919,878 and 23,011,289 on horsechromosome 23.

Preferably, the genetic marker is selected from the genetic markerslisted in Table 8.

Most preferably the genetic marker is located at position 22,999,655 onhorse chromosome 23, corresponding to position 939 in SEQ ID NO:1.

More specifically, the methods can comprise identifying in said DNA thenucleotide in one or more specific position(s) selected from thepositions 22,919,878; 22,920,361; 22,920,434; 22,920,646; 22,920,717;22,921,203; 22,922,079; 22,922,780; 22,923,569; 22,924,120; 22,924,142;22,924,299; 22,924,380; 22,924,407; 22,926,098; 22,926,188; 22,926,872;22,927,387; 22,927,607; 22,928,220; 22,928,537; 22,928,587; 22,929,137;22,930,011; 22,932,024; 22,932,895; 22,933,218; 22,936,034; 22,940,759;22,942,423; 22,945,643; 22,946,599; 22,948,774; 22,949,055; 22,949,108;22,949,240; 22,949,710; 22,956,846; 22,960,132; 22,960,528; 22,960,710;22,964,042; 22,965,059; 22,967,119; 22,967,656; 22,967,915; 22,968,898;22,973,984; 22,974,589; 22,979,124; 22,980,014; 22,982,879; 22,984,588;22,985,746; 22,988,210; 22,988,991; 22,993,092; 22,994,591; 22,999,058;22,999,655; 23,002,606; 23,003,956; 23,008,772; 23,008,789; 23,009,648;23,010,164; and 23,011,289, on horse chromosome 23.

Most preferably the methods comprise identifying in said DNA thenucleotide in the specific position 22,999,655 on horse chromosome 23.

More specifically, the methods can comprise determining in said DNA thepresence or absence of:

-   -   i) the nucleotide C in a nucleotide position corresponding to        position 939 in SEQ ID NO: 1,    -   ii) the nucleotide A in a nucleotide position corresponding to        position 939 in SEQ ID NO: 3,    -   iii) the nucleotide C and/or T in a nucleotide position        corresponding to position 51 in SEQ ID NO: 5,    -   iv) the nucleotide A and/or G in a nucleotide position        corresponding to position 51 in SEQ ID NO: 6,    -   v) the nucleotide C and/or T in a nucleotide position        corresponding to position 51 in SEQ ID NO: 7,    -   vi) the nucleotide G and/or C in a nucleotide position        corresponding to position 51 in SEQ ID NO: 8,    -   vii) the nucleotide A and/or G in a nucleotide position        corresponding to position 51 in SEQ ID NO: 9,    -   viii) the nucleotide T and/or G in a nucleotide position        corresponding to position 51 in SEQ ID NO: 10,    -   ix) the nucleotide T and/or C in a nucleotide position        corresponding to position 51 in SEQ ID NO: 11,    -   x) the nucleotide C and/or T in a nucleotide position        corresponding to position 51 in SEQ ID NO: 12,    -   xi) the nucleotide A and/or G in a nucleotide position        corresponding to position 51 in SEQ ID NO: 13,    -   xii) the nucleotide A and/or C in a nucleotide position        corresponding to position 51 in SEQ ID NO: 14    -   xiii) the nucleotide G and/or C in a nucleotide position        corresponding to position 51 in SEQ ID NO: 15,    -   xiv) the nucleotide C and/or T in a nucleotide position        corresponding to position 51 in SEQ ID NO: 16,    -   xv) the nucleotide G and/or A in a nucleotide position        corresponding to position 51 in SEQ ID NO: 17,    -   xvi) the nucleotide G and/or C in a nucleotide position        corresponding to position 51 in SEQ ID NO: 18,    -   xvii) the nucleotide C and/or A in a nucleotide position        corresponding to position 51 in SEQ ID NO: 19,    -   xviii) the nucleotide T and/or C in a nucleotide position        corresponding to position 51 in SEQ ID NO: 20,    -   xix) the nucleotide C and/or T in a nucleotide position        corresponding to position 51 in SEQ ID NO: 21,    -   xx) the nucleotide C and/or T in a nucleotide position        corresponding to position 51 in SEQ ID NO: 22,    -   xxi) the nucleotide C and/or A in a nucleotide position        corresponding to position 51 in SEQ ID NO: 23,    -   xxii) the nucleotide C and/or G in a nucleotide position        corresponding to position 51 in SEQ ID NO: 24,    -   xxiii) the nucleotide A and/or T in a nucleotide position        corresponding to position 51 in SEQ ID NO: 25,

Preferably the methods comprise determining in said DNA the presence orabsence of:

-   -   i) the nucleotide C in a nucleotide position corresponding to        position 939 in SEQ ID NO: 1,    -   ii) the nucleotide A in a nucleotide position corresponding to        position 939 in SEQ ID NO: 3,    -   iii) the nucleotide C and/or T in a nucleotide position        corresponding to position 51 in SEQ ID NO: 5,    -   iv) the nucleotide C and/or T in a nucleotide position        corresponding to position 51 in SEQ ID NO: 7,    -   v) the nucleotide T and/or C in a nucleotide position        corresponding to position 51 in SEQ ID NO: 20,    -   vi) the nucleotide C and/or T in a nucleotide position        corresponding to position 51 in SEQ ID NO: 21,    -   vii) the nucleotide C and/or T in a nucleotide position        corresponding to position 51 in SEQ ID NO: 22,    -   viii) the nucleotide C and/or A in a nucleotide position        corresponding to position 51 in SEQ ID NO: 23,    -   ix) the nucleotide C and/or G in a nucleotide position        corresponding to position 51 in SEQ ID NO: 24,    -   x) the nucleotide A and/or T in a nucleotide position        corresponding to position 51 in SEQ ID NO: 25,

Most preferably the methods comprise determining in said DNA thepresence or absence of:

-   -   i) the nucleotide C in a nucleotide position corresponding to        position 939 in SEQ ID NO: 1 or    -   ii) the nucleotide A in a nucleotide position corresponding to        position 939 in SEQ ID NO: 3.

The horse can be selected from any horse or breed of horses belonging tothe species Equus caballus. Examples of horse breeds can be found inTable 1.

TABLE 1 Classification of horse breeds as gaited or non-gaited, wheregaited horses have the ability to perform alternative gaits in additionto the three basic gaits walk, trot and gallop. Breed ClassificationAmerican Saddlebred gaited Campolina gaited Icelandic horse gaitedKentucky Mountain Saddle Horse gaited Mangalarga Marchador gaitedMarwari horse gaited Missouri Foxtrotter gaited Paso Fino gaitedPeruvian Paso gaited Racking horse gaited Rocky Mountain Horse gaitedSpotted Saddle horse gaited Standardbred* gaited Tennessee Walker gaitedWalkaloosa gaited Akhal teke non-gaited American Paint Horse non-gaitedAndalusian non-gaited Arabian non-gaited Belgian non-gaited Dolenon-gaited Exmoor Pony non-gaited Friesian non-gaited Haflingernon-gaited Hanoverian non-gaited Lusitano non-gaited North Swedish Drafthorse non-gaited Norwegian Fjord non-gaited Quarter Horse non-gaitedSelle Français non-gaited Shetland Pony non-gaited Suffolk Punchnon-gaited Thoroughbred non-gaited Trakehner non-gaited *Two separatepopulations, pacers and trotters, many trotters seem to be able totoelt.

According to one aspect of the invention the methods according to thepresent invention can be used for paternity testing of horses.

It is contemplated that any method or composition described herein canbe implemented with respect to any other method or composition describedherein. The use of the word “a” or “an” when used in conjunction withthe term “comprising” in the claims and/or the specification may mean“one,” but it is also consistent with the meaning of “one or more,” “atleast one,” and “one or more than one.” These, and other, embodiments ofthe invention will be better appreciated and understood when consideredin conjunction with the following description and the accompanyingdrawings. It should be understood, however, that the followingdescription, while indicating various embodiments of the invention andnumerous specific details thereof, is given by way of illustration andnot of limitation. Many substitutions, modifications, additions and/orrearrangements may be made within the scope of the invention withoutdeparting from the spirit thereof, and the invention includes all suchsubstitutions, modifications, additions and/or rearrangements.

LEGENDS TO FIGURES

The following drawings form part of the present specification and areincluded to further demonstrate certain aspects of the presentinvention. The invention may be better understood by reference to one ormore of these drawings in combination with the detailed description ofspecific embodiments presented herein.

FIG. 1. Results of genome-wide analysis of 70 Icelandic horsesclassified as four-gaited or five-gaited. The highly associated SNP atnucleotide position Chr23: 22,967,656 base pairs is marked by an arrow.

FIG. 2. Genomic region harboring the Gait locus on chromosome 23controlling the pattern of locomotion in horses. The DMRT3 gene is notproperly annotated in this assembly but it is represented by the Ensembltranscript ENSECAT00000025062 indicated by an arrow in this figure. Thefigure is adapted from an output from the UCSC genome browser(www.genome.ucsc.edu).

FIG. 3. Nucleotide and amino acid alignment for codon 296 to 306 ofhorse DMRT3 including codon 301 in which a nonsense mutation occurs inthe allele associated with the ability to pace.

FIG. 4. Alignment of amino acids 249 to 331 (numbered according to thehorse sequence) in the DMRT3 protein from different vertebrate speciesincluding the wild-type (WT) and mutant (MUT) form of the horse DRMT3protein. “.” indicates gap in the alignment; “-” indicates identity tothe master sequence used (cattle); * indicates the nonsense mutation atcodon 301 in the horse mutant allele.

FIG. 5. Schematic presentation of the predicted wild-type and mutant(gait) forms of the DMRT3 protein in horses. DM=zinc-finger like DNAbinding module. DMA=protein domain of unknown function present in DMRTproteins.

FIG. 6. EMSA using an oligonucleotide representing a DMRT3-binding motifand in vitro-translated myc-tagged DMRT3 wild-type and mutant proteins.

Super-shifts were demonstrated using an anti-myc antibody (thatrecognizes both forms) or with an anti-DMRT3 antibody that recognizesthe C-terminal part of the wild-type protein, but not the truncatedform. An oligonucleotide corresponding to a DMRT1-binding site was alsoused and gave similar results (data not shown). The cold competingoligonucleotide was added in 150× excess. GS=gel-shift representingcomplex between DMRT3 protein and oligonucleotide; SS=super-shiftrepresenting complex between antibody, DMRT3 protein andoligonucleotide.

DETAILED DESCRIPTION OF THE INVENTION

The present inventors have demonstrated that there is a locus, herenamed Gait, on horse chromosome 23 that has a major impact on thepattern of locomotion in horses. The present results show thathomozygosity for a recessive allele at this locus is required for theability of a horse to pace. It is postulated that the nonsense mutationat nucleotide position 22,999,655 in exon 2 of the DMRT3 gene is thecausative mutation for the Gait allele. DMRT3 is a highly conserved genepresent in all vertebrates studied so far. The function of the DMRT3protein has not been established by any previous studies but the factthat it is expressed in brain and in the spinal cord of the mouse (MGI,www.informatics.jax.org) is consistent with a critical role forcontrolling locomotion as demonstrated by the present study. Thenonsense mutation underlying the gait allele may very well have aphenotypic effect in the heterozygous condition since it occurs in thelast exon of DMRT3 and is expected to encode a truncated form of theprotein (SEQ ID NO:4) that lacks the last 174 amino acids (FIG. 5). TheDNA binding DM domain of DMRT3 is located in the N-terminal part that ismaintained in the truncated form (FIG. 5). The mutant form of DMRT3 maytherefore be able to bind to its target DNA sequences but may showdefects as regards the interaction with other proteins required for itsnormal function and may therefore has a dominant-negative effect inheterozygotes. It is worth noticing that only one of the Icelandichorses was homozygous for the wild-type (non-pace) allele at the Gaitlocus.

This study has established a genetic marker that can be used to predictthe genetic constitution of a horse as regards its pattern oflocomotion. We predict that the gait allele is present in most, if notall, gaited breeds some of which are listed as gaited in Table 1 and itmay occur at a low frequency in other breeds as well. The marker alsopredicts a horse capacity to trot or pace at a high speed as it is foundat a high frequency in horses used for harness racing. Further, wepredict that horses with at least one wild-type allel are better atshowjumping, traditional dressage, and completion racing in gallop.

The pattern of locomotion determines the ability of a horse to usealternative gaits, as well as the horse's ability to trot or pace at afast speed, and its ability to perform in dressage. Alternative gaitsinclude, pace, and the ambling gaits exemplified by toelt, running walk,rack, classic fino, paso corto, paso largo, paso ilano, sobreandando,fox trot.

A horse being homozygous or heterozygous for the gait allele can bepredicted to have the ability to use alternative gaits and to trot athigh speed. A horse being homozygous or heterozygote for the wild typeallele can be predicted to have better ability to perform inshowjumping, dressage, and completion racing in gallop.

The utility of this invention in the horse breeding industry includesthe determination of the genotype of potential breeding animals tomaximise the chance to obtain a progeny with a favoured pattern oflocomotion. The information about the genotype at the DMRT3 locus mayalso be used by sellers and buyers of horses to predict the ability ofthe horse to perform different gaits. Furthermore, the methods accordingto the invention can be used to effectively introgress the gait alleleinto non-gaited breeds.

According to one aspect of the invention the methods according to thepresent invention can be used for selecting horses for breeding.

Accordingly, one aspect of the invention provides methods for selectinga horse for breeding, said methods comprising determining in a DNAsample obtained from said horse the allele of at least one geneticmarker, wherein said at least one genetic marker is located in theregion between the flanking SNPs at nucleotide positions 22,628,976 onhorse chromosome 23. The genetic marker can be selected from singlenucleotide polymorphisms (SNPs) and insertion/deletions (INDELs).

Preferably, the genetic marker is selected from the genetic markerslisted in Tables 4, 5, 7 and 8.

Preferably the genetic marker is located in the region between theflanking SNPs at nucleotide positions 22,919,878 and 23,011,289 on horsechromosome 23.

Preferably, the genetic marker is selected from the genetic markerslisted in Table 8.

Most preferably the genetic marker is located at position 22,999,655 onhorse chromosome 23, corresponding to position 939 in SEQ ID NO:1.

The most reliable test for determining the genotype at the Gait locus isto determine the presence and/or absence of the nonsense mutation inexon 2 of DMRT3 (nucleotide position 22,999,655 on chromosome 23,corresponding to nucleotide position 939 in SEQ ID NO:3). However,genetic markers located in the interval between the flanking markers atnucleotide positions 22,628,976 and 23,315,071, and more specificallygenetic markers located in the interval between positions 22,919,878 and23,011,289, exemplified by the markers listed in Table 8, show a more orless strong association to the genotype for the causative SNP atnucleotide position 22,999,655 due to the presence of linkagedisequilibrium in the region. Accordingly, one or more of these markers,individually or in combination, can be used to determine the genotype atthe Gait locus, and can consequently as well be used in the methodsaccording to the present invention.

The term “sample” or “biological sample” according to the presentinvention refers to any material containing nucleated cells from saidhorse to be tested. In a preferred embodiment the biological sample tobe used in the methods of the present invention is selected from thegroup consisting of blood, sperm, hair roots, milk, body fluids as wellas tissues including nucleated cells.

DNA extraction, isolation and purification methods are well-known in theart and can be applied in the present invention. Standard protocols forthe isolation of genomic DNA are inter alia referred to in Sambrook, J.,Russell. D. W. Molecular Cloning: A Laboratory Manual, the thirdedition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor. NewYork, 1.31-1.38, 2001 and Sharma. R. C., et al. “A rapid procedure forisolation of RNA-free genomic DNA from mammalian cells”, BioTechniques,14. 176-178. 1993.

According to the present invention the term “SNP” refers to a singlenucleotide polymorphism at a particular position in the horse genomethat varies among a population of individuals. SNPs can be identified bytheir location within the disclosed particular sequence, i.e. within theinterval of 22,628,976 and 23,315,071 base pairs on horse chromosome 23or their name as shown in Tables 4, 5, 7 and 8. SNPs identified as beinguseful for predicting the ability of a horse to use different gaitsaccording to the present invention are shown in Tables 4, 5, 7 and 8.For example, the SNP BIEC2-620109 of Table 5 indicates that thenucleotide base (or the allele) at nucleotide position 22,967,656 onchromosome 23 of the reference sequence as referred to herein may beeither Cytosine (C) or Thymidine (T). The allele associated with orindicative for a horse able to use five gaits is in the case of SNPBIEC2-620109 of Table 5 Thymidine (T).

The term “determining in said DNA the allele of at least one geneticmarker” in accordance with the present invention refers to a method fordetermining or identifying whether a particular nucleotide sequence ispresent in a DNA sample.

The term “identifying in said DNA the nucleotide in one or more specificposition on the horse chromosome 23” refers to a method for determiningthe identity of the nucleotide in said specific position on the horsechromosome 23, i.e. to determine whether the nucleotide in said specificposition is Adenosine (A), Cytosine (C), Guanosine (G), or Thymidine(T).

There are several methods known by those skilled in the art fordetermining whether a particular nucleotide sequence is present in a DNAsample and for identifying the nucleotide in a specific position in aDNA sequence. These include the amplification of a DNA segmentencompassing the genetic marker by means of the polymerase chainreaction (PCR) or any other amplification method, interrogate thegenetic marker by means of allele specific hybridization, the3′exonuclease assay (Taqman assay), fluorescent dye and quenchingagent-based PCR assay, the use of allele-specific restriction enzymes(RFLP-based techniques), direct sequencing, the oligonucleotide ligationassay (OLA), pyrosequencing, the invader assay, minisequencing,DHPLC-based techniques, single strand conformational polymorphism(SSCP), allele-specific PCR, denaturating gradient gel electrophoresis(DGGE), temperature gradient gel electrophoresis (TGGE), chemicalmismatch cleavage (CMC), heteroduplex analysis based system, techniquesbased on mass spectroscopy, invasive cleavage assay, polymorphism ratiosequencing (PRS), microarrays, a rolling circle extension assay,HPLC-based techniques, extension based assays, ARMS (AmplificationRefractory Mutation System), ALEX (Amplification Refractory MutationLinear Extension), SBCE (Single base chain extension), molecular beaconassays, invader (Third wave technologies), ligase chain reaction assays,5′-nuclease assay-based techniques, hybridization capillary arrayelectrophoresis (CAE), protein truncation assays (PTT), immunoassays,and solid phase hybridization (dot blot, reverse dot blot, chips). Thislist of methods is not meant to be exclusive, but just to illustrate thediversity of available methods. Some of these methods can be performedin accordance with the methods of the present invention in microarrayformat (microchips) or on beads.

The invention thus also relates to the use of primers or primer pairs,wherein the primers or primer pairs hybridize(s) under stringentconditions to the DNA comprising the interval between nucleotidepositions 22,628,976 and 23,315,071, preferably between positions22,919,878 and 23,011,289, base pairs on horse chromosome 23, or to thecomplementary strand thereof.

Preferably the primers or primer pairs hybridize(s) under stringentconditions to the sequences SEQ ID NO: 1, 3 and 5 to 25.

Preferably, the primers of the invention have a length of at least 14nucleotides such as 17 or 21 nucleotides.

More specifically the primers can be selected from SEQ NO:26, SEQ IDNO:27, SEQ ID NO:30, and SEQ ID NO:31.

In one embodiment, the primers actually binds to the position of theSNPs as referred to in Tables 4, 5, 7 and 8. Such an allele specificoligonucleotide in accordance with the present invention is typically anoligonucleotide of at least 14 to 21 nucleotide bases in length designedto detect a difference of a single base in the target's genetic sequenceof the horse to be tested. In accordance with the present invention oneor more specific primers can be applied in order to identify more than asingle SNP as referred to herein. As a consequence, when binding isperformed under stringent conditions, such primer or such primers is/areuseful to distinguish between different polymorphic variants as bindingonly occurs if the sequences of the primer and the target have fullcomplementarily. It is further preferred that the primers have a maximumlength of 24 nucleotides. Such primers can be coupled with anappropriate detection method such as an elongation reaction or anamplification reaction, which may be used to differentiate between thepolymorphic variants and then draw conclusions with regard to the horseas regards its ability to use different gaits.

Hybridisation is preferably performed under stringent or highlystringent conditions. “Stringent or highly stringent conditions” ofhybridization are well known to or can be established by the personskilled in the art according to conventional protocols. Appropriatestringent conditions for each sequence may be established on the basisof well-known parameters such as temperature, composition of the nucleicacid molecules, salt conditions etc.: see, for example, Sambrook et al.“Molecular Cloning, A Laboratory Manual”, CSH Press, Cold Spring Harbor,1989 or Higgins and Hames (eds.), “Nucleic acid hybridization, apractical approach”, IRL Press, Oxford 1985, see in particular thechapter “Hybridization Strategy” by Britten & Davidson. Typical (highlystringent) conditions comprise hybridization at 65° C. in 0.5×SSC and0.1% SDS or hybridization at 42° C. in 50% formamide, 4×SSC and 0.1%SDS. Hybridization is usually followed by washing to remove unspecificsignals. Washing conditions include conditions such as 65° C., 0.2×SSCand 0.1% SDS or 2×SSC and 0.1% SDS or 0.3×SSC and 0.1% SDS at 25° C.-65°C.

The term “nucleotide positions 22,628,976 and 23,315,071 base pairs onhorse chromosome 23” and other similar denoted nucleotide positionsrefer to the horse reference sequence according to the September 2007Equus caballus draft assembly EquCab2 (UCSC version equCab2). EquCab2was produced by The Broad Institute. EquCab2 is available at thewww.genome.ucsc.edu genome browser.

Examples

A genome-wide screen for genes affecting pattern of locomotion using thehorse SNP chip comprising assays for 54,602 single nucleotidepolymorphisms in the horse genome (Illumina EquineSNP50 BeadChip;http://www.illumina.com/products/equine_snp50_whole_genome_genotyping_kits.ilmn)was performed. A population material comprising 70 Icelandic horses inwhich 30 were classified as four-gaited and 40 were classified asfive-gaited, i.e. only the latter had a documented ability to pace, wasused in the assay.

Animal Material.

Blood samples were collected from 70 Icelandic horses from Sweden.Genomic DNA was prepared from all horses using QIAamp DNA Blood Midi Kit(Qiagen). The owners of the horses were asked to classify their horsesas four-gaited or five-gaited. Hair samples were collected from 61Swedish Standardbred horses and 2 North-Swedish Trotter. DNA from sixhair roots was extracted by adding 97 μl Chelex solution and 7 μlProteinas K and incubated in 56° C. for 60 minutes followed by anincubation in 95° C. for 10 minutes.

Genome-Wide Analysis (GWA).

The GWA was performed using the Illumina EquineSNP50 BeadChip(http://www.illumina.com/products/equine_snp50_whole_genome_genotyping_kits.ilmn).The statistical analysis of the data was carried out using the softwarePLINK (Purcell et al. 2007. PLINK: a tool set for whole-genomeassociation and population-based linkage analyses. Am. J. Hum. Genet.81:559-575).

DNA Sequencing.

A number of coding and non-coding regions located between the flankingSNPs at nucleotide positions 22,628,976 and 23,315,071 base pairs onhorse chromosome 23 was PCR amplified and sequenced to identify sequencepolymorphisms. All primers used for these experiments are listed inTable 2. The amplicons were amplified with standard PCR conditions and(2720 Thermal Cycler, Applied Biosystems, Foster City, Calif.). StandardSanger sequencing was performed using an AB3730 capillary sequencer(Applied Biosystems, Foster City, Calif.).

TABLE 2 Primers used for PCR amplification and sequencing of selected regions in horses Amplified region Amplicon Nucleotide positionsForward primer Reverse primer ANKRD15exon1.1 chr23:22792627-22793280TCATACCAGCTTGCCACACT GAGGAGAGAGAGCTCGTGGA ANKRD15exon1.2chr23:22793162-22793792 CTAATGGAGACCCGCAGAAG GCCGGAACTCCTTTATCCTCANKRD15exon1.3 chr23:22793704-22794386 GAGAAGTGGCGGGGAATTATGCCCCACGACTTTATTCTCA ANKRD15exon1.4 chr23:22794261-22794946TGCAGACGAGAGACCAAATG AAACCCAGAAGTGCCTGAGA ANKRD15exon1.5chr23:22794844-22795453 GCGGACAGTGGCTATAGGAG AATACATTGTCCCCACCCTTCANKRD15exon2 chr23:22807940-22808575 ATGGGATTTGAGCTGAGTGGAAGCCTGATGCTGAGAAGGA ANKRD15exon3 chr23:22809005-22809616TTGCATGCACACAATTTTCC CTGGGGGTTTCTGAGTTCTG ANKRD15exon4chr23:22810246-22810904 GCAACCCAGGTTATCCCTTT TCACCTTCTGCACTTGCATTANKRD15exon5 chr23:22812005-22812621 AAGTCGACTGAGGGGCTCTTACCTTGGCCCAGATAGGTTT ANKRD15exon6 chr23:22815102-22815741TCCCCAGGAACATACAGCTC TGGAAAGGATTTGAGGATGC ANKRD15exon7chr23:22817755-22818429 GCTTCTGGCCTCACGAAATA TGGCATGAAGACACCACAATANKRD15exon8 chr23:22818653-22819254 AGCCCCAGTACAGACCACACGGGAAGTCGCCTACACTGAA ANKRD15exon9 chr23:22820739-22821346GAGGATCCGTGGGATACAGA AGCAAGTCTCCTGAGCAAGC ANKRD15exon10chr23:22821626-22822233 CAGAGGACACATCTGCCTGA CAAAACCATCCTGGAAATGGANKRD_GAP chr23:22836558-22837273 GTCCATCCCCTTCTCTCCTCTGTCAGCTGCAGAATGGAAG PRIMER_DS7 chr23:22851938-22852292AGACTGGCCCTGAGCTAACA CTGAAGGTGCCCTCTACAGC PRIMER_DS5chr23:22868140-22868803 TTACCTGCCCCTTTGTTTTG CATCTTTGCCCCTCAGACTCPRIMER_DS2 chr23:22869516-22870124 TTACGTGGCACCCCTACTTCAGCCTGGACTCTGTCCTTGA PRIMER_DS1 chr23:22872699-22873368TGCTGCCCTCTGTCTATGTG AAAGTAACGATGCGGTGGAC PRIMER_DS4chr23:22874773-22875445 AAATGGCTGTGCCGTTTTAC CTGTGTGACCAAGCTCTCCAPRIMER_DS3 chr23:22876084-22876784 GAAAATGCTGACGTGCTGAACTTGCTGCCTTTTGCCTATC PRIMER_DS6 chr23:22876563-22877255GCAGAGCGACCTGGAGATAG GGCCTTAGAGGGACACATGA BIETOP-620109B_3chr23:22967269-22967902 CCTCTCACCCAGACACCATT AGTTGGCAAACAACAGGACABIETOP-620109D_2 chr23:22967525-22968019 AAGTCCTTTCTTGGGGGCTAGGTCCATCGTTGACCAAAAT BIETOP-620109C_2 chr23:22967526-22968041AGTCCTTTCTTGGGGGCTAA ACGGCACCACCATCATCTAT DMRT3exon0chr23:22985884-22986463 GCCCCAACTTAAGACCCTCT CCGCGCTGCTTAGGAGTCDMRT3exon0B chr23:22985884-22987295 GCCCCAACTTAAGACCCTCTTACCTGGCTTGTCGAGCTG DMRT3GAP chr23:22986413-22987358GAGCACGCTCAGACCCTATC AAAGAGCTCCGAAGTTTTTGC DMRT3exon2.1chr23:22999117-22999797 CTCCTTCCAAGAAGCCTGTG AGAGTCTGCGGAAAACCTCADMRT3exon2.2 chr23:22999709-23000396 CCTTGAGCTCATACCCCATCACTAAAGCCGCAGAGCAGAG DMRT3exon2.3 chr23:23000251-23001049GAGAGGCCTCGTCCTGTGTA TCCCACTCACATTTCCCAAT PRIMER_1chr23:23009567-23010210 CAAGGGCATGAGGAGTGTTT ACTCCATGATTGCACAACGAPRIMER_2 chr23:23027620-23028300 TCATTCCACCAGCAATGTGTGGCCACTGCAGAAGAAAGAG PRIMER_3 chr23:23048139-23048767CTGTTGTCCCAGCCCTGTAT AGGTGAGTCCAGGCTAGCAA DMRT2exon1chr23:23055803-23056469 GAGCCCGAGCGGATAATACT ATTAGGACCGCACAGGACACDMRT2exon2 chr23:23056584-23057237 GCGGCTAGGGTGGTACTTCTCTCGTCCTCGTCCTCGTC DMRT2_GAP chr23:23057214-23057971 GAGGACGACGAGGACGAGCCACTTTCAAGGCCTCTCTG DMRT2exon2GAP chr23:23057214-23057971GAGGACGACGAGGACGAG CCACTTTCAAGGCCTCTCTG DMRT2exon3chr23:23059113-23059736 CTGGGGTGACTCTAGCAAGG TCACACCAAGGCAAATTTCADMRT2exon4.1 chr23:23061639-23062293 CCCCCAAAGGGAACTATTTTGAACTGAGGTGGTGGCATTT DMRT2exon4.2 chr23:23062130-23062788TTCAGGGTCTGGGAATATGG TCCAACTTGTTTGGCTACGA DMRT2exon4.3chr23:23062686-23063285 GGCCCCTAAGAAACACAGAG CCTGTAGACCCCAGAGACCAPRIMER_4 chr23:23067103-23067766 GGTCCAAATTGTAGGGCTGATTCCCCAGGAGGTTCTCTTT PRIMER_5 chr23:23069404-23070095CCAGATCAAGGGGAATGCTA CAAGGCAGACCAATCCATTT PRIMER_6chr23:23076510-23077194 CAAAGTAAGCATCCCCAGGA GCAGCACCTCTTTCCTCATCPRIMER_7 chr23:23080154-23080820 TGGAAATTTTGGGCTGTTTCTTTCTCCAGGGAATTTGTGC PRIMER_8 chr23:23085336-23086005GCTGCTGGAGACCAGAAAAG CGAAGGGCACCTATTCAAAA

In Depth Genome Resequencing.

DNA samples from two Icelandic horses, one female mutant DMRT3homozygote and one male control (homozygous wild-type) were prepared forsequencing. Illumina paired-end libraries were generated from these DNAsamples (mean insert sizes of approximately 220 bases). The twolibraries were sequenced (2×100 bp) on seven and five lanes,respectively, using an Illumina HiSeq instrument. The reads were mappedto the horse genome (EquCab2 reference assembly) using the software BWA,and PCR-duplicates were removed using the software Picard(http://picard.sourceforge.net). The average read depth obtained foreach sample was approximately 30×. SNPs and small insertions/deletionswere called from the mapping data after subjecting the alignments torealignment around indels and then variant calling using the GenomeAnalysis Toolkit (GATK). The variant calls were subjected to recommendedVariantFiltrationWalker filters for SNPs listed in the GATK wild page(http://www.broadinstitute.org/gsa/wiki/index.php/The_Genome_Analysis_Toolkit)and read alignments overlapping SNP and insertion/deletion calls withinthe 438 kb Gait locus were then manually reviewed to remove obviousartifact calls. Read depths observed in one kilobase windows were usedto call candidate duplications in the minimum IBD region, and mappingdistances and orientations between paired reads were used to detectstructural variations in relation to the reference assembly. Thesoftware ANNOVAR was used to annotate SNPs in relation to Ensembl genes.

SNP Analysis Using TaqMan Assays.

TaqMan assays were designed to screen the SNPs at chromosome 23,nucleotide position 22,967,656 (BIEC2_(—)620109; the SNP included in theIllumina SNP panel showing the strongest association to the phenotype)and at nucleotide position 22,999,655 (DMRT3.3; the SNP causing apremature Stop codon in DMRT3 exon 2). Custom TaqMan SNP Genotypingassays (Applied Biosystems, Foster City, Calif.) designed for these twoSNPs are summarized in Table 3. Probe and primer designs were obtainedfrom the Applied Biosystems web page(http://www5.appliedbiosystems.com/tools/cadt/) using the customgenotyping assays order option. The ABI PRISM 7900 HT sequence detectionsystem for 384-well format (Applied Biosystems, Foster City, Calif.) wasused for the analysis.

TABLE 3 Description of TaqMan assays for SNPs at nucleotidepositions 22, 967, 656 (BIEC2_620109) and 22, 999, 655 (DMRT3.3) on horse chromosome 23. SEQ ID NO BIEC2_620109Forward Primer Seq. GCAAAGTGCAGAAATAGTCTTTTGGA 26 Reverse Primer Seq.CACTCTTTTGGAATGGTTCACATTAAGG 27 Reference allele* CReporter Sequence (FAM) TAGTGCAAACGGTACGTT 28 Non-reference allele TReporter Sequence (VIC) AAATAGTGCAAACAGTACGTT 29 DMRT3.3Forward Primer Seq. CCTCTCCAGCCGCTCCT 30 Reverse Primer Seq.TCAAAGATGTGCCCGTTGGA 31 Reference allele* C Reporter Sequence (VIC)CTGCCGAAGTTCG 32 Non-reference allele A Reporter Sequence (FAM)CTCTGCCTAAGTTCG 33 *according to the EquCab2 assembly (available atwww.genome.ucsc.edu genome browser)

Genome-Wide Analysis Reveals a Locus on Horse Chromosome 23 Controllingthe Pattern of Locomotion.

Statistical analysis of the SNP-chip data for the 70 Icelandic horseswith a phenotypic classification as four-gaited or five-gaited wascarried using PLINK; 39,695 SNPs passed the quality control. Achi-square test was performed for each marker separately in order totest for a significant difference in genotype frequencies betweenfour-gaited versus five-gaited horses. A genetic model assuming arecessive mode of inheritance was used. Ten thousand permutations wereused to correct for multiple testing. The statistical analysis revealeda highly significant association between a SNP (BIEC2_(—)620109, SEQ IDNO: 5) at nucleotide position 22,967,656 base pair on horse chromosome23 and the gait phenotype (P=0.0002, genome-wide significance; FIG. 1).The two SNPs immediately flanking the highly associated SNP were locatedat nucleotide positions 22,628,976 (BIEC2-619907, SEQ ID NO: 6) and23,315,071 (BIEC2-620244, SEQ ID NO: 7) and these showed only weakassociations to the phenotype (P=0.01 for the SNP at position 22,628,976base pair and P=0.32 for the SNP at position 23,315,071 base pair). Thisresult demonstrated that one or more sequence polymorphisms controllingthe pattern of locomotion is located in the vicinity of the SNP atposition 22,967,656 base pair (the most associated SNP) and within theinterval defined by the flanking markers at positions 22,628,976 and23,315,071 base pairs showing a significantly weaker association to thegait phenotype. This region spans 686 kilo base pairs and five genes arelocated in the interval ANKRD15, DMRT1, DMRT3, DMRT2 and GTF2A2 (FIG.2). This locus was named the Gait locus and the results were consistentwith a recessive inheritance of the allele associated with the abilityto pace, while the wild-type allele (Non-pace) at this locus wasdominant.

Resequencing of Selected Regions Refine the Localization of the GaitLocus.

A number of amplicons (Table 2) from the genomic region harbouring theGait locus as defined by the genome-wide screen (from nucleotideposition 22,628,976 to position 23,315,071 on chromosome 23) wereresequenced in a small set of four-gaited and five-gaited horses inorder to refine the localization of the Gait locus. All the sequencepolymorphisms detected in this analysis are summarized in Table 4. Theresults showed that there is a distinct haplotype associated with therecessive gait allele and that the haplotype block showing a completeassociation to gait in this breed breaks up at nucleotide position22,877,015 just upstream of the DMRT1 gene. The results refine thelocalization of the Gait locus to the interval from nucleotide position22,877,015 base pair to position 23,315,071 base pair; ANKRD15 islocated outside the critical interval for Gait.

TABLE 4 Sequence polymorphisms detected by resequencing amplicons fromthe genomic region harbouring the Gait locus on horse chromosome 23Phenotype Five-gaited Four-gaited Horse Horse SNP Position Horse 1 Horse2 Horse 3 Horse 4 Horse 5 Horse 6 Horse 7 Horse 8 Horse 9 10 11ANKRD15.1 22,793,939 GG GC GC GG GG GG GG GG GG GG GG ANKRD15.222,810,322 GG GA GA GG GA GG GA GG GG GG GA ANKRD15.3 22,812,345 GG GTGT GG GT GG GT GG GG GG GT ANKRD15.4 22,812,251 TT TT TT TT TC TT TC TTTT TT TC ANKRD15.5 22,818,132 TT CT CT TT CT TT CT TT TT TT CT ANKRD15.622,818,158 GG GA GA GG GA GG GA GG GG GG GA ANKRD15.7 22,821,872 CC CACA CC CA CC CA CC CC CC CA ANKRD15.8 22,821,884 GG GG GG GG CG GG CG GGGG GG CG SNP.1 22,868,190 nt nt CC CC nt nt nt nt CC CC CT SNP.222,868,678 nt nt GA AA nt nt nt nt AA AA GA SNP.3 22,872,820 nt nt GG GGnt nt nt nt GG GG GC SNP.4 22,876,848 nt nt CA AA nt nt nt nt AA AA AASNP.5 22,877,015 nt nt TT TT nt nt nt nt TT TT CT BIEC2_62010922,967,656 CC CC CC CT TT TT TT TT TT TT TT DMRT3.1 22,986,593 TT TT TTCT CC CC nt nt CC CC CC DMRT3.2 22,987,143 CC CC CC CT TT TT nt nt TT TTTT DMRT3.3 22,999,655 CC CC CC CA AA AA nt nt AA AA AA DMRT3.422,999,665 GC GG GG GC CC CC nt nt CC CC CC SNP.6 23,009,648 nt nt AA ATnt nt nt nt TT TT TT nt = not tested

A Nonsense Mutation Located in Exon 2 of DMRT3 Shows CompleteConcordance with the Ability to Pace.

The critical interval for the Gait locus comprises the four genes DMRT1,DMRT2, DMRT3 and GTF2A2. The DMRT genes belong to a family oftranscription factors that contains the zinc-finger like DNA binding DMdomain (Murphy et al. 2007. Vertebrate DM domain proteins bind similarDNA sequences and can heterodimerize on DNA. BMC Mot. Biol. 8:58). Wesequenced most of the DMRT exons in this region and identified a smallnumber of sequence polymorphisms (Table 4). One of these (DMRT3.3),located in exon 2 of DMRT3 at nucleotide position 22,999,655, caused anonsense mutation in the allele associated with the ability to pace(FIG. 3). Thus, the gait allele is predicted to encode a truncated formof the DMRT3 protein (SEQ ID NO: 4) lacking the last 174 amino acids,reducing the total size of the protein from 474 to 300 amino acids. Fulllength wild-type horse DMRT3 is shown as SEQ ID NO: 2. An alignment ofthe part of the DMRT3 protein including the mutated amino acid position301 (Serine) in horses shows that this protein is highly conserved amongvertebrates including fish, bird and mammalian species (FIG. 4).

TaqMan assays were designed for the polymorphisms at nucleotidepositions 22,967,656 (the most significantly associated SNP in the GWAanalysis) and at position 22,999,655 (the mutation in DMRT3 creating apremature Stop codon): These were used to screen all 70 Icelandic horsesincluded in this study. Both SNPs showed complete association betweenhomozygosity for the non-reference allele at both, loci and thephenotype (Table 5), the statistical support for an association wasoverwhelming (P=6.73×10⁻¹⁰ for both SNPs, Fisher's Exact Test). Theresults imply that there is very strong linkage disequilibrium betweenthese two SNPs in the studied population, the two SNPs are located 32kilo base pairs apart. Nine animals that were classified as four-gaitedwere homozygous for the haplotype associated with the gait allele (Table5). These animals were either misclassified by their owners, which isfully possible, or the Gait genotype shows incomplete penetrance due tointeraction with environmental factors (for instance training) or otherunknown genetic factors.

We tested 2 North-Swedish Trotters and 61 Swedish Standardbred horses(both used for harness racing in Sweden) to investigate if the gaitallele is present in other horse breeds. We found that both the 2North-Swedish Trotters and 59. Standard bred horses were homozygous forthe DMRT3 nonsense mutation at nucleotide position 22,999,655 on horsechromosome 23 while the remaining 2 Standardbred horses wereheterozygous A/C. The high frequency of this allele in these breedsstrongly suggests that the mutation has a favourable effect on theability to trot at a fast speed. In deed, the two horses identified asbeing heterozygous for the gait allele were also considered as beingpoor trotters. We predict that the gait allele is present at a highfrequency in most, if not all, gaited horse breeds as well as horsesused for harness racing.

TABLE 5 Highly significant association between SNPs at nucleotideposition 22,967,656 (BIEC2-620109) and 22,999,655 (DMRT3.3) on horsechromosome 23 in relation to the phenotypic classification of Icelandichorses as four-gaited or five-gaited. Statistics was calculated usingFisher exact test, with the Gait allele as the recessive allele. OR =odds ratio. MARKER BIEC2-620109 DMRT3.3 Allele 1 (A₁) C C Allele 2 (A₂)T A Wild-type A₁/— 21 21 A₂/A₂ 9 9 Five-gaited A₁/— 1 1 A₂/A₂ 39 39 p6.73E−10 6.73E−10 OR 83.18 83.18 A₁/— = A₁/A₁ or A₁/A₂

TABLE 6 Genotype distribution for a nonsense mutation (A) in DMRT3 amonghorse populations. Breed Number CC CA AA Icelandic Horse 70 0.01 0.300.69 Standardbred Trotter 61 0.00 0.03 0.97 Cold Blooded Trotter 2 0.000.00 1.00

TABLE 7 SNP sequences SEQ ID. NO sequence SNP position SEQ ID NO: 5TTGTTGGGGTCTTATGCAAAGTGCAGAAATAGTCTTTTGGA BIEC2_620109 22 967 656AAAACGTAC[C/T]GTTTGCACTATTTTCTTATTTCTATTCACC CTTAATGTGAACCATTCCAASEQ ID NO: 6 AGAAATGATATATAAAAATTACGAATGCCTCTTAGACAGAAT BIEC2-61990722 628 976 CCTTATGT[A/G]TGGCACAGAAGTATTTAGTTCGCTTAACAGATATTGAGTGCTTATATGAG SEQ ID NO: 7CTCTTCCTTGCATCCTATCCCCCTAGTGTCGCAAGGGAAGT BIEC2-620244 23 315 071TGTGAGAGA[C/T]GAGCTTGTAGATCTGCTCTAGAAAATAG GCCTGTTTTCTTAAGAAACCGTSEQ ID NO: 8 CAGAGTGCCGGTCTGTGGCTGTGGGCGCTGACGAGCACA ANKRD15.122 793 939 TGGACAACATT[G/C]TCGTGTACCACAGGGGCTCCAGGTCCTGTAAGGATGCTGCTGTGGGGACA SEQ ID NO: 9AGAACTCATTCAAAACCACCAGGCTTACTAGGCTTTTTTAA ANKRD15.2 22 810 322ATAGACTTG[A/G]CTTTGAACTTCTAAGTGCAGGATCTAAAA CCACTGGCGAAATTTCTGGAASEQ ID NO: 10 TTACCTGCATGCCTCTCCCCCTAAACCATTTCTAGCATGTG ANKRD15.322 812 345 TGGGCAGAG[T/G]GGGCATCGTGCTGCCCTGCTCACTGGATCACTCTGGGAACGTTTCCTTCA SEQ ID NO: 11AAGGATATGGTGAGTCTGACCTACAGACACTGTCCCCGGT ANKRD15.4 22 812 251CTGTACAAAG[T/C]GCCCAAGTGGTGACAAAGCATCCCTCG CCTGCCCCCTGAGCTGTTACCTGSEQ ID NO: 12 AACGCCAAAGCCAGCCAGGTGACTGCGCTTGCTTCCTGGG ANKRD15.522 818 132 CTCATGCTCA[C/T]ACTGCTGTGACCCGCACAGGTGCCCACGCCACACTTCCCACCGCTCGGCA SEQ ID NO: 13GCTTGCTTCCTGGGCTCATGCTCACACTGCTGTGACCCGC ANKRD15.6 22 818 158ACAGGTGCCC[A/G]CGCCACACTTCCCACCGCTCGGCACT CACTCATGGCCCAGCCCCGAGTCCSEQ ID NO: 14 ACTGAATGTATACATTTTGTGCCTGAACTCACCAGCAAACA ANKRD15.722 821 872 GAAGGCAGA[A/C]AACCAAGGGTTGAAGGCTGGAGCTGTCACAGTAGAAGTTGAGCCAGCAGG SEQ ID NO: 15CATTTTGTGCCTGAACTCACCAGCAAACAGAAGGCAGAAA ANKRD15.8 22 821 884ACCAAGGGTT[G/C]AAGGCTGGAGCTGTCACAGTAGAAGTT GAGCCAGCAGGAATTTGCTGGCCSEQ ID NO: 16 TCACTCTAATCAAGTTGCTATCACCATTCACACAATTGTCCA SNP.122 868 190 GGATAGTA[C/T]TGGGACCCCAGAAAGATCACGCCGCTCCATTCCCATTTCCCACTTGTTCC SEQ ID NO: 17CTGGGCTGAAACAGGTGGTCCTGCTTTCCCCGCCTGCCTG SNP.2 22 868 678GTCAGGCTGC[G/A]CTCTTCTCCCCTCCCCAGGCTTAAGTC ACTTCATGCAGAACCCTTTATACSEQ ID NO: 18 CCAGCATTCTCCGCTTTCAACTTTCTCCCGCTCCTCCAATC SNP.3 22 872 820CAAACTGGA[G/C]TTAGCATCAGCTACCCACAATGATCAAG CATTTTCTGTGTGGCAGGCCTGSEQ ID NO: 19 AGGCAAGAAGCGATAGGCAAAAGGCAGCAAGAGCTGGAC SNP.4 22 876 848CTGCAGATTTG[C/A]AAGTTCTCTGGAGCCAGTAGGTGGAA ACCTCATCAGCAAATGAACGCAGGSEQ ID NO: 20 CCACACTGAGAGTCTTATTTGCTGATAGAAATGCAGAGACT SNP.5 22 877 015TCTCTTTTC[T/C]GAGGCTTTCAACCTCGTACTTAATTCTCCT AAGTGAGAAAGAAACCACTCSEQ ID NO: 21 ACCAGCGGGAGACTGAGGCTGCGAGCGCCGCAAAGACGG DMRT3.1 22 986 593GTGCCGCATCT[C/T]TGGCCAGCCCGGAGCGCACGCGGCC GCCGGAGCTGCGGGACCAAGGACCGSEQ ID NO: 22 CCGTCTCAGCCGCCGCCGCCGCAGCGTCCCGCCGCCGAG DMRT3.2 22 987 143TTGGCTGCGGC[C/T]GCCGCGCTGCGCTGGGCCACCGAGC CGCAGCCCGGGGCGCTGCAGGCGCASEQ ID NO: 23 GGAGGTCCTCCTCTCCAGCCGCTCCTCGGCCTCGGCCGC DMRT3.3 22 999 655CGACCGAACTT[C/A]GGCAGAGCCCGAGAGCCTCGTGTTG CCCTCCAACGGGCACATCTTTGAACSEQ ID NO: 24 CTCTCCAGCCGCTCCTCGGCCTCGGCCGCCGACCGAACT DMRT3.4 22 999 665TCGGCAGAGCC[C/G]GAGAGCCTCGTGTTGCCCTCCAACG GGCACATCTTTGAACACACCTTGAGSEQ ID NO: 25 GGCCTGGCCCCTAGGGCATTGAAGGGCTGGGGAGAGTCA SNP.6 23 009 648CATGTACTCCC[A/T]CTGTGGCCTGAAGACCTACCTGGAGG GAAACCAGCTTGCTTAGGGGGCCT

TABLE 8 Sequence variants on horse chromosome 23 showing strong geneticassociation with the Gait mutation in horses. The Gait mutation occurson horse chromosome 23, nucleotide position 22,999.655 bp and isindicated in bold italics below. Location/ Coordinate Ref. Var. Typeconsequence¹ (EquCab2) Allele² allele(s)³ SNP intronic 22919878 A G SNPintronic 22920361 C T SNP intronic 22920434 A T SNP intronic 22920646 GA SNP intronic 22920717 C T SNP intronic 22921203 G T SNP intronic22922079 A G SNP intronic 22922780 C T SNP intronic 22923569 A G SNPintronic 22924120 G A INDEL intronic 22924142 — A SNP intronic 22924299T G SNP intronic 22924380 A G SNP intronic 22924407 C T SNP intronic22926098 C T SNP intronic 22926188 T C SNP intronic 22926872 A C SNPintronic 22927387 C T SNP intronic 22927607 T C SNP intronic 22928220 CT SNP intronic 22928537 T G SNP intronic 22928587 A G SNP intronic22929137 G A SNP intronic 22930011 A C SNP intronic 22932024 G A SNPintronic 22932895 A G SNP intronic 22933218 A G⁴ SNP intronic 22936034 AG SNP intronic 22940759 T G SNP intronic 22942423 T A SNP intronic22945643 G C SNP intronic 22946599 A T SNP intronic 22948774 C T SNPintronic 22949055 A G SNP intronic 22949108 A G SNP intronic 22949240 TC SNP intronic 22949710 A G SNP intronic 22956846 G T SNP intronic22960132 A C SNP intronic 22960528 T C SNP intronic 22960710 C T SNPintronic 22964042 C T INDEL intronic 22965059 — GA SNP intronic 22967119C T SNP intronic 22967656 C T SNP intronic 22967915 G C SNP intronic22968898 G A SNP intronic 22973984 C T SNP intronic 22974589 T C SNPintergenic 22979124 T C SNP intergenic 22980014 C T SNP intergenic22982879 T C INDELs intergenic 22984588 A — INDEL intergenic 22985746 G— SNP intronic 22988210 C A SNP intronic 22988991 T G SNP intronic22993092 C A SNP intronic 22994591 C A SNP intronic 22999058 G A

SNP intergenic 23002606 A G INDEL intergenic 23003956 — TG SNPintergenic 23008772 G A SNP intergenic 23008789 G A SNP intergenic23009648 A T SNP intergenic 23010164 G A SNP intergenic 23011289 G C¹Location: Indicates where the SNP is located in relation to Ensemblgenes. In cases of coding sequence overlap, the predicted consequence tothe protein is indicated. The gene intersection was performed using thesoftware ANNOVAR. ²Ref. allele. This is the reference allele in thehorse genome assembly (EquCab2). ³Var. allele: This is the variantallele at polymorphic position showing very strong association with theGait mutation. For insertion polymorphisms in relation to the referenceassembly (EquCab2), the reference allele is denoted “—” and fordeletions in relation to the reference the variant allele is denoted“—”. The sequenced mutant horse was homozygous for the variant allele atall sites except one (see Footnote 4) listed in this table unlessotherwise stated in the Var. allele column. ⁴This SNP was identified asheterozygous (AG) in the mutant horse and homozygous for the referenceallele in the control horse. The G-allele at this SNP has likelyoccurred subsequent to the DMRT3 nonsense mutation.

Electrophoretic Mobility Shift Assays (EMSA).

The oligonucleotide 5′-ggatccTCGAGAACAATGTAACAATTTCGCCC-3′ and itscomplementary sequence were annealed in 10 mM Tris pH 7.5, 1 mM EDTA, 50mM KCl by firstly heating to 95° C. for 2 min and thereafter cooled to25° C. (2 min/degree). The duplex was labelled with Klenow DNApolymerase and [α-32P]-dCTP and purified using a Bio-Rad Micro Bio-Spin30 column. DMRT3 wild type and mutant protein were produced by invitro-translation using a TNT Quick Coupled Transcription/TranslationSystem (Promega). EMSA was performed as described by Culbertson & Leeds,2003 (Looking at mRNA decay pathways through the window of molecularevolution. Curr. Opin. Genet. Dev. 13, 207-214) with the followingmodifications. No plasmid DNA was added and 1.0 μl in vitro-translatedprotein and 150× cold competitor were used. The reaction mixture wasincubated on ice for 20 min before adding the radioactive oligo andthereafter incubated at room temperature for 30 min. Gels were run at150 V in room temperature. Both full-length wild-type and mutant DMRT3protein were found to bind a previously defined DMRT-binding motif (FIG.6). Thus, the DMRT3 mutation does not lead to an altered expressionpattern and the mutant protein appears to maintain its cellularlocalization and DNA binding profile. It may therefore be a dominantnegative form with normal DNA-binding but defective interaction withother proteins. This would be consistent with the clear phenotypiceffects observed in heterozygotes. However, the mutation is not fullydominant as CA heterozygotes and AA homozygotes show distinct phenotypicdifferences.

CONCLUSIONS

We have presented abundant evidence that the DMRT3_Ser301STOP mutationhas a major effect on gaits in horses. Our interpretation of thephenotypic consequences of this mutation is that homozygosity for themutation is required but not sufficient for pacing, as many StandardbredTrotters and some Icelandic horses that are homozygous mutant do notpace. On the other hand heterozygosity or homozygosity for the mutationare permissive to enable a variety of four-beat ambling gaits to beperformed, with genetic modifiers that may be unique to each gaitedbreed. The mutation promotes ambling gaits and pace and it inhibits thetransition from trot or pace to gallop, which explains its highfrequency in pacers and trotters used for harness racing. It is an openquestion if the mutation alters the fate of DMRT3-neurons or changestheir transcriptional regulation, but it is clear that these neuronsmust have a key role for the control centre in the spinal cordcoordinating limb movements.

All of the compositions and/or methods disclosed and claimed herein canbe made and executed without undue experimentation in light of thepresent disclosure. While the compositions and methods of this inventionhave been described in terms of preferred embodiments, it will beapparent to those of skill in the art that variations may be applied tothe compositions and/or methods and in the steps or in the sequence ofsteps of the method described herein without departing from the concept,spirit and scope of the invention. More specifically, it will beapparent that certain agents which are both chemically andphysiologically related may be substituted for the agents describedherein while the same or similar results would be achieved. All suchsimilar substitutes and modifications apparent to those skilled in theart are deemed to be within the spirit, scope and concept of theinvention as defined by the appended claims.

1. A method for predicting the pattern of locomotion in a horseincluding the ability to use alternative gaits, to trot or pace at afast speed, and to perform in dressage, said method comprising steps of;i) extracting DNA from a sample obtained from a horse, ii) determiningin said DNA the presence or absence of at least one genetic marker,wherein said at least one genetic marker is located in the regionbetween the flanking SNPs at nucleotide positions 22,628,976 and23,315,071 base pairs on horse chromosome
 23. 2. The method according toclaim 1, wherein said at least one genetic marker is located in theregion between the flanking SNPs at nucleotide positions 22,919,878 and23,011,289 base pairs on horse chromosome
 23. 3. The method according toclaim 1, wherein the genetic marker is selected from the genetic markerslisted in Table 4, Table 5, Table 7, and Table
 8. 4. The methodaccording to claim 2, wherein the genetic marker is selected from thegenetic markers listed in Table
 8. 5. The method according to claim 2comprising identifying in said DNA the nucleotide in one or morespecific position selected from the positions 22,919,878; 22,920,361;22,920,434; 22,920,646; 22,920,717; 22,921,203; 22,922,079;22,922,780;22,923,569; 22,924,120; 22,924,142; 22,924,299; 22,924,380;22,924,407; 22,926,098; 22,926,188; 22,926,872; 22,927,387; 22,927,607;22,928,220; 22,928,537; 22,928,587; 22,929,137; 22,930,011; 22,932,024;22,932,895; 22,933,218; 22,936,034; 22,940,759; 22,942,423; 22,945,643;22,946,599; 22,948,774; 22,949,055; 22,949,108; 22,949,240; 22,949,710;22,956,846; 22,960,132; 22,960,528; 22,960,710; 22,964,042; 22,965,059;22,967,119; 22,967,656; 22,967,915; 22,968,898; 22,973,984; 22,974,589;22,979,124; 22,980,014; 22,982,879; 22,984,588; 22,985,746; 22,988,210;22,988,991; 22,993,092; 22,994,591; 22,999,058; 22,999,655; 23,002,606;23,003,956; 23,008,772; 23,008,789; 23,009,648; 23,010,164; and23,011,289, on horse chromosome
 23. 6. The method according to claim 1comprising determining in said DNA the presence or absence of: i) thenucleotide C in a nucleotide position corresponding to position 939 inSEQ ID NO: 1, ii) the nucleotide A in a nucleotide positioncorresponding to position 939 in SEQ ID NO: 3, iii) the nucleotide Cand/or T in a nucleotide position corresponding to position 51 in SEQ IDNO: 5, iv) the nucleotide A and/or G in a nucleotide positioncorresponding to position 51 in SEQ ID NO: 6, v) the nucleotide C and/orT in a nucleotide position corresponding to position 51 in SEQ ID NO: 7,vi) the nucleotide G and/or C in a nucleotide position corresponding toposition 51 in SEQ ID NO: 8, vii) the nucleotide A and/or G in anucleotide position corresponding to position 51 in SEQ ID NO: 9, viii)the nucleotide T and/or G in a nucleotide position corresponding toposition 51 in SEQ ID NO: 10, ix) the nucleotide T and/or C in anucleotide position corresponding to position 51 in SEQ ID NO: 11, x)the nucleotide C and/or T in a nucleotide position corresponding toposition 51 in SEQ ID NO: 12, xi) the nucleotide A and/or G in anucleotide position corresponding to position 51 in SEQ ID NO: 13, xii)the nucleotide A and/or C in a nucleotide position corresponding toposition 51 in SEQ ID NO: 14 xiii) the nucleotide G and/or C in anucleotide position corresponding to position 51 in SEQ ID NO: 15, xiv)the nucleotide C and/or T in a nucleotide position corresponding toposition 51 in SEQ ID NO: 16, xv) the nucleotide G and/or A in anucleotide position corresponding to position 51 in SEQ ID NO: 17, xvi)the nucleotide G and/or C in a nucleotide position corresponding toposition 51 in SEQ ID NO: 18, xvii) the nucleotide C and/or A in anucleotide position corresponding to position 51 in SEQ ID NO: 19,xviii) the nucleotide T and/or C in a nucleotide position correspondingto position 51 in SEQ ID NO: 20, xix) the nucleotide C and/or T in anucleotide position corresponding to position 51 in SEQ ID NO: 21, xx)the nucleotide C and/or T in a nucleotide position corresponding toposition 51 in SEQ ID NO: 22, xxi) the nucleotide C and/or A in anucleotide position corresponding to position 51 in SEQ ID NO: 23, xxii)the nucleotide C and/or G in a nucleotide position corresponding toposition 51 in SEQ ID NO: 24, and/or xxiii) the nucleotide A and/or T ina nucleotide position corresponding to position 51 in SEQ ID NO:
 25. 7.The method according to claim 4 comprising determining in said DNA thepresence or absence of: i) the nucleotide C in a nucleotide positioncorresponding to position 939 in SEQ ID NO: 1, and/or ii) the nucleotideA in a nucleotide position corresponding to position 939 in SEQ ID NO:3.
 8. A method for predicting the pattern of locomotion in a horseincluding the ability to use alternative gaits, to trot or pace at afast speed, and to perform in dressage, said method comprising steps of;i) extracting protein from a sample obtained from a horse, and ii)determining in said protein sample the presence or absence of atruncated form of the DMRT3 protein.
 9. The use of the method accordingto any of claims 1-8 for paternity testing.
 10. The use of the methodaccording to any of claims 1-8 for selection a horse for breeding,
 11. Amethod for selection a horse for breeding, said method comprisingdetermining in a DNA sample obtained from said horse the allele of atleast one genetic marker, wherein said at least one genetic marker islocated in the region between the flanking SNPs at nucleotide positions22,628,976 on horse chromosome 23.